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ABSTRACT 

This article presents a summary of some research on a 
statistical approach to identifying potentially successful and 
dropout students in a college population. The technique utilized is 
Multiple Discriminant Analysis (MDA). MDA eliminates the shortcomings 
of regression-correlation and uses only quantifiable information. The 
2 variables that proved significant in the 3 tests were (1) the 
Social Consciousness Score test (administered by the Educational 
Testing Service) and (2) the percentage of college costs financed by 
grants. It was shown that successful students have a greater concern 
for social issues which is reflected in their greater persistance in 
college; also that successful students have a higher percentage of 
grants than unsuccessful students. High School Rank proved 
significant, while High School Average did not. Conclusions include 
that (1) MDA has application (69-8790 successful in study) for the 
college population as a whole; (2) loans are not necessarily the most 
effective way to finance higher education; and (3) the entire area of 
useful indicators of college student success should be reevaluated. 
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INTRODUCTION 



The process of selecting and evaluating High School students for the 
scarce openings in our Colleges goes on continuously. The process may be 
informal, interviews and personal judgment being the major ingredients; or 
it may be a highly structured mathematical process, most often using 
regression-correlation techniques. Neither method has been distinguished 
by its success. 

The shortcomings of the subjective approach are obvious: too often 

the interview is hurried and the important indicators of possible success 
in college are not uncovered. The structured research has most often been 
an attempt to predict future quality point averages using prior data. 

The reasons for the lack of previous success in the structured re- 
search are twofold: i. regression-correlation is only suitable when the 

prior data is adequately separated; and 2. the premise upon which the 
technique is based is not suitable: that is, that it is possible to rank 

students in a significant manner to predict future success or failure. 

Rather than ranking students on a scale, the emphasis should be on 
identifying potentially successful or dropout students. The technique 
ideally suited for this is Multiple Discriminant Analysis (MDA). 

Multiple Discriminant Analysis (MDA) is a technique which eliminates 
the shortcomings of regression-correlation techniques. First, MDA places 
a dependent variable in a category rathe r/ than in a relative position 
and thus eliminates the problem of the independent variates being bunched 
too closely for regression techniques to rank them adequately. Secondly, 
since MDA uses only quantifiable information, personal interviewing 
techniques must be limited to factual data thus avoiding the gray area of 
subjective information. 

MDA has proven successful in a variety of research efforts; the 
author used the technique to separate successful and unsuccessful students. 
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The research took place at Utica College, a branch of Syracuse University. 

METHOD 

A quantifiable student profile was the first objective of the /study. A total 
of eight variables were chosen for the initial discriminant test and/ they are listed 
below. 

TABLE I 



Variable 



Variables initially chosen 

Source 



College Student Questionnaire (given to 
all fresimien at Utica College) 
Educational Testing Service (taken by 
students in High School) 

Educational Testing Service (taken by 
students in High School) 

Records at Utica College 



Social Consciousness Score 
Scholastic Aptitude Test - Verbal 
Scholastic Aptitude Test - MAth 
College First Semester Grade Pt. Avg. 

High School Average " " " " 

High School Rank (Quintile) " " " " 

Percentage of College Costs 

Financed by Loans " " " " 

Percentage of College Costs 

Financed by Grants " " " " 

The Social Consciousness Score ranges from V: - 40. The Social Consciousness 
Score is administered by the Educational Testing Service. The test has a series of 
questions designed to indicate whether a student has an interest in matters of social 
concern. There is no attempt to imcover a student’s political persuasions but rather 
whether he has an interest in such matters. High -scores indicate that a student is 
aware of social problems (poverty, racial difficulties) and is concerned about them. 
Low scores indicate a lack of concern for such problems. In no way does it indicate 
political or social bias. The variable is particularly suited for MDA because it is 
the only quantifiable score relating a student's interest in the world around him. 

The Scholastic Aptitude Tests (Verbal and Mathematics) are two of the primary 
indicators that admission counselors use to decide on the acceptance of students. 

They were inserted in the model to test whether they should warrant such importance. 
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Previous studies in predicting college dropouts indicate that first 

semester grade point average is the most significant indicator in a stu- 

1 

dent's profile. The model will test this premise. 

The High School Rank and High School Average are also common criteria 

used to admit students into college. How effective they are in identifying 

future dropouts has never been studied. 

The percentage of college costs financed by loans and the percentage 

of costs financed by grants are the two economic variables in the study. 

They are there to find the best method of financing an individual's 

education. Most experts in the field have advocated loans as the cheapest 

2 

and most effective way of keeping students in college. The model will 
test this assumption. 

Since all the variables are quantifiable they are acceptable in an 

3 

MDA program. The mathematics of the technique is listed below. 

Mathematics of MDA 



Assume that there is a set of scores for each group; which linear 
combination of scores will best discriminate between the groups? 

Assume that there are N normally distributed observations on p 
variables Xj_ which are denoted by Xit U = i, 2 . . . p; t =1, 2 . . . N). 



i 

I 

I 1 

[ For a comprehensive view of studies on student characteristics, see 

j John R. Hills, "Admissions Procedures that Make Sense", The American 

1 College , (Nev; York: Wiley and Sons, 1962), pp. 16 - 24. 

! 2 

Both Seymour Harris and Otto Eckstein have advocated loans as a 
means of retaining students in college. See Seymour Harris, editor, 
Higher Education in the United States . .Cambridge: Harvard University 

Pre s s , i960} , p. 9. 

The mathematics was developed by R. A. Fisher in "The Use of 
Multiple Measurements in Taxonomic Problems", Annals of Eugenics , 

Q no. 7. 
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Classify these into two groups for t = i, 2 . . . Nj_ and t = Nj_ ..f i, 

Nj_ 7 - 2, . . . Nj_ -t- N 2 = N. The means in each group are defined by: 

U) , X« = >. (i = J., 2 . . . p) 

C-r/ ^ r.-, .y /j; A/^ 

Let the difference of the means be: 

(2) di = X}* - (i = 1, 2 . . . p) 

The object is to find the linear function of the difference of the 
me ans : 

(3) Z = kj^d l< 2 d 2 

f 

I 

which best discriminates between the two jsets of variables. Its 
square should be a maximum relative to its variance. The variance is 
proportional to: 

(4) Q = ^ ■'ikjSij 

where k = coefficient of each variable and S = each 
variable in the model. 

In order to maximize (3) under the condition that the variance (4) 

is a constant, the following function is formed: 

P 



(5) F = z 2 - AQ = A ^ k^kjdidj - A kikjSij 

v\/here A is a Lagrange 

multiplier. 

Expression (b) is differentiated partially with respect to 
kj (j = 1, 2, . . . p) and the following set of equations is obtained: 



(6) d. 2_ kAdA = A L. k.S4 . ( J = 1, 2, . . . p) 

t-i ^-3 

To simplify letA = ^^kidj_ and the following is obtained: 
^ilkl f Sj_2k2 f . . . ^ ^ipkp =* dp 
^12^1 -f ^22^2 1^ . . . f ^2pkp = d2 



"iph 



®ip''2 r . . • y- Sppkp - dp 



The solutions are proportional to the estimates of the coefficients 
of the linear function which in the population corresponding to the sample 
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discriminates best between the groups. 

A test of slgnifioanoe is also Included in the solutions. The test uses the 
F distribution and the generalised student distribution in the follovdng manner: 

j NiN2(kidi ... kpdp) 

R - ij 

then the variance ratio: 

r = liL- P - 

p(1 - R^) 

is the F distribution for m = p and nj = N - p - 1 degrees of freedom. In this way 
the hypothesis may be tested that the disorlmlnant function was a chance occurrence. 

Results of Identification Model 

In all three tests the students from both categories (dropouts and successful 
students) were chosen randomly from the files of Utica College. After the relevant 
information was obtained for each student, the entire group was analysed by the MDA 
model. The model then identified successful and dropout students. The results of 

such identification are listed below. 

Table II 



Variables and F Test Results for Three Discriminant Tests 



Variables 



Test 1 Test 2 



1. Social Consciousness Score 

2. Percentage of College Costs 
Financed by Grants 

3. First semester grade point 
average 

4. High School Rank (Quintile) 

5. Scholastic Aptitude Test 
Verbal Score 

6. Scholastic Aptitude Test 
Mathematics Score 

7. Percentage of College Costs 
Financed by Loans 

8. High School Average 



B„101^ 

16 . 924 ^‘ 

20 . 967 ^^ 

1.358 

0.671 

2.259 

1.799 



15.200«‘' 

14.240^^ 

36.842^^ 

17.166^^ 



0.488 



Test 3 
16.321^^ 

13.124^^ 

51 .768^ 
25 . 839 ^' 



} 
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Table II - continued 
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Variables Test 1 

Number and Percentage of Active 
Students correctly identified 25(69.4%) 

Number and Percentage of Inactive 
Students correctly identified 29(78.3%) 

•^•Significant at .05 level 



Test 2 Test 3 



32 ( 80 . 0 %) 33 ( 86 . 0 %) 

35 ( 87 . 5 %) 32 ( 82 . 0 %) 



The students were drav^n randomly, a different group for each test, 
from the files of Utica College. 

The Social Consciousness Score proved significanx in all three tests. 
This means that one can identify by this variable a dropout or successful 
student and that there is significant difference in social awareness be- 
tween students who stay in college and those who drop out. The successful 
students have greater concern for social issues and this is reflected in 
their greater persistence in college. 

The percentage of college costs financed by grants was also signifi- 
cant in all three tests. Thus, successful students have higher percentage 
of grants than unsuccessful students. One is tempted to assume that the 
result follows from the fact that orants are given to more successful 

students. This was not proven, however, by subsequent correlation tests 

4 

run on all eight variables. 

ii more likely explanation is that the awarding of a grant provides 
a degree of security to a student, thus providing more incentive to 
remain in college. 

The fact that the percentage of college costs financed by loans was 

5 

not significant disputes the hypothesis set down by Harris and Eckstein. 



4 

The correlation coefficient between the High School Average and 
% of Grants was .261. Between the First Semester Grade Point Average 
the Percentage of Grants was only .291. 

5 
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The awarding of a loan is often looked upon unfavorably by the recipient. The worry 
incurred over the rising debt is also a problem for such students. 

Of the two high school indicators, the High School Rank proved significant 
while the High School Average was not significant. The rank is biased in that a good 
student in stiff competition may have a low rank while a poor'^r student may rank hxgh xn 
a less competitive situation. The High School average should account for this, however, 

but this proved not to be significant in the testing. 

The first semester college grade point average was, expectedly, significant 
in all three tests. There is nothing like a good academic beginning to give a stu- 
dent confidence. 

The two aptitude tests. Verbal and Mathematics, were surprisingly not signi- 
ficant. This could be due to the lack of separation of the scores (most students 
at Utica College range from 430 - 4^0 in scores) or to the fact that at the marginal 
schools such as Utica, the score is not as useful as at other more prestigious schools. 
Whichever the reason, the aptitude tests wer(3 not a factor in identifying college 

dropouts. 

Although one should not generalize for the entire college population, the 
results of the study at Utica College show dearly that the faith of admissions coun- 
selors in High School averages and the Aptitude Tests is open to question. Lxkewxse, 
the economic assumption discussed earlier, that loans are the most efficient method 
of financing higher education expenses, is not verified by the study at Utica College. 

The MDA model has clearly shown that grants are identified with successful 
students. Loans, moreover, are not necessarily associated with dropouts, but are 

not significantly associated with either group. 

Conclusions 

Three significant conclusions result from the research on dropouts at Utica 
College. One is that the statistical technique. Multiple Discrxminant Analysis, 
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proved highly successful (69 - 87^) in identifying dropouts at the college. This is 
remarkable in view of the rather poor results that have been recorded in previous 
correlation tests. 

While the analysis was only applied to the Utica College population there is 
application for the college population as a whole. The method is certainly applicable 
— the variables may be chosen to suit the particular college. The important point 
is to uncover the relevant variables for the particular college. MDA is a perfect 
tool for such analysis. 

Secondly, a n-umber of economic policy conclusions emerge from the results, 
the most important of which is that loans are not necessarily the most effective 
way to finance higher education. 

Thirdly, the entire area of useful indicators of student success in college 
should be reevaluated. There is some question as to whether High School Grades 
and Achievements Tests as criteria deserve the attention they get from Admissions 
Counselors. 

Future research should be directed toward clarifying the issues raised here. 
Multiple Discriminant Analysis is an ideal method for such research. 



^Studies conducted by the Dean's Office at Utica College yielded correlation 
coefficients around .08. This means that only .06^ of the variance v/as explained 
by the independent variable. 
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